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ABSTRACT 

We have searched the hybrid BALQSO catalogue of Scaringi et al. derived from 
data release 5 of the Sloan Digital Sky Survey in order to compile the largest sample 
of objects displaying spectral signatures which may be indicative of radiative line 
driving. The feature in question is the "ghost of Ly-a" , a line- locking feature previously 
identified in the broad C IV and Si IV absorption lines of a small fraction of BALQSOs, 
and formed via the interaction of Ly-a photons with N v ions. 

We test, where possible the criteria required to produce an observable ghost fea- 
ture. These criteria include: significant broad absorption, strong intrinsic Ly-a emis- 
sion, narrow Ly-a, strong N v absorption, and a weak far-UV continuum. No single 
ghost-candidate meets all of these criteria. Furthermore, we find that these criteria 
are not met significantly more often in ghost-candidates than in a comparison sam- 
ple chosen to exhibit relatively featureless broad absorption troughs. Indeed, the only 
significant differences we find between our ghost-candidate and comparison samples, 
is that on average, our ghost-candidate sample displays (i) significantly stronger N v 
absorption, and (ii) the onset of absorption occurs at lower velocities in our ghost- 
candidate objects. 

Significantly, we find no evidence for an excess of objects whose absorption troughs 
bracket the location of the Ly-a-Nv line-locking region, rather the location of ghost- 
like features appears to be independent of any systematic velocity, with comparable 
numbers appearing both redward and blueward of the ghost-zone. Thus, the majority 
of objects identified here as strong ghost-candidates are likely multi-trough interlopers 
whose absorption feature simply bracket the region of interest. 
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1 INTRODUCTION 

Broad absorption line quasars (BALQSOs) as their name 
suggests, show strong broad blue-shifted absorption lines 
in their spectra believed to be indicative of high veloc- 
ity (~0.1c) out-flowing winds. BAL QSOs represent approx- 



Trump et. all 120061: iKniEEe et alj 120081; IScarinei et. al.1 


20091). The majority 


~ 85%) (Spravberrv and Foltzl 19921: 


Reichard et. al.1 2003b 


) of BALQSOs are known as HiBALs, 



displaying absorption in lines of high ionisation only (e.g. 
N v, Si IV, and C iv). The remainder are classified as LoB- 
ALs and show in addition broad absorption in lines of low 
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ionisation species, most notably Al ill and Mg II. LoBALs 
are further sub-classified according to the presence of Fe ab- 
sorption, the Fe LoBals. Sinc e the spectra o f LoB als are in 
general redder than HiBals, iBecker et. al.l (|2000l ) suggests 
that BALQSOs and in particular Fe LoBALs may be an 
early phase in the development of emerging or re-fuelled 
quasars. 

There has been much debate about the relationship 
between BALQSOs and the general quasar population as 
a whole. Simple unification schemes, suggest that BALQ- 
SOs and non-BALQSOs are similar objects and that any 
observed differe nces in the i r spectra arise due to ori- 
entation effects (lOgle et. all [l999l: IWevmann et. all Il99ll : 
ISchmidt and Hinesl 1 19991 : lElvisI I2OO0I ) . In this picture, the 
relative fraction of BALQSOs to non-BALQSOs has a sim- 
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pie geometric interpretation, representing the fraction of sky, 
as seen from the source, obscured by gas (ie. the source cov- 
ering fraction), which in the simplest case, can be related to 
the flow geometry (e.g. opening angle). 

In recent years interest in BALQSOs outflows has risen 
sharply, principally because of the realisation that such high 
velocity outflows carry a substantial amount of energy and 
momentum into the ISM, and may therefore be important in 
driving AGN feedback as w ell as providing a mechanism for 
quen ching star formation |Scannapieco. Silk and Bouwens! 
120051 ). Indeed, the discovery of highly ionised, very high- 
velocity X-ray outflows (e.g. Pounds et al. 2003a, 2003b; 
Pounds and Reeves 2009), for which the energy transport 
(in terms of mechanical energy) is large enough to inter- 
rupt the growth of the host galaxy, may provide the causal 
link behind the well-known correlation between the mass of 
the central black hole and the mass of the bulge (e.g. Fer- 
rarese and Merritt 2000; Gebhardt et al. 2000; Tremaine 
et al. 2002). 

However, despite the increasing importance of 
these outflows the precise mechanism responsible for 
accelerating them to high velo city remains uncer- 
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the main contenders (see e.g. IProgal (2007) for a review) 
A major hindrance to progress in this area is the absence 
of a clean discriminatory observational signature of what 
are essentially orthogonal wind geometries. For radio-quiet 
objects, AGN unification schemes tend to favour equatorial 
wind geometries (e.g. Elvis 20 00). By contrast, o b serva- 
tions of radio-loud BA LQSOs (|Becker et. all Il997l . l200d ; 
iBrotherton et. aT, I 1 19981 ) and in particular polarisation 
observations o f PKS 0040-005 indica te a non-equatorial 
BAL outflow (|Brotherton et. all [2006). One possible ex- 
planation for the apparent difference in wind geometry 
between radio-loud and radio-quiet objects, is that different 
acceleration mechanisms operate in these two classes of 
objects. 

Perhaps the strongest indicator that radiative driving 
is responsible for accelerating at least some of the high ve- 
locity flows, is the appearance of line-locking features in the 
spectra of a small fraction of BAL quasars (e.g. Turnshek 
et al. 1988, Weymann et al. 1991, Korista et al. 1993, Arav 
1996, Vilkoviskij and Irwin 2001). The most well-studied of 
these is the so-called ghost of Ly-a, a small hump seen in the 
absorption troughs of a small fraction (less than a few per- 
cent) of BALQSOs formed via the interaction between Ly-a 
photons and N v ions (see e.g. Korista et al. 1993, Arav 
et al. 1995, Arav 1996, and references therein). However, 
the reality of ghosts remains very much an open question. 
Previous studies have been limited to small samples of rel- 
atively low quality spectra, with strong ghosts often only 
becoming apparent in composite spectra (Arav et al. 1995, 
North et al. 2006). Moreover, distinguishing between line- 
locking features and similar features caused by the chance 
alignment of multiple absorption systems is difficult with 
relatively small samples. Indeed, Korista et al. 1993, showed 



that in a sample 72 objects, evidence for line-locking features 
was merely suggestive rather than convincing. 

The aim of this paper is twofold: (i) to compile the 
largest sample of uniformly selected ghost-candidate spec- 
tra, and (ii) attempt to determine the origin of their ghost 
features, by testing on a source by source basis, whether they 
meet the original criteria as set out by Arav (1995) neces- 
sary for ghost-features to be observed. In §2 we describe the 
mechanism proposed for ghost formation and outline Arav's 
criteria for the production of strong ghost signatures. Selec- 
tion of those objects comprising our ghost-candidate spectra 
and our non-ghost control sample is described in §3. In §4 
we present the results of testing each of the spectra in our 
ghost-candidate and comparison samples against each of the 
criteria in turn necessary for the formation of an observable 
ghost feature. We discuss the implications of our findings in 
85. Our conclusions are summarised in 56. 



2 THE GHOST OF LY-a 

The feature that is poetically known as the ghost of Ly-a 
is a hump seen in the absorption trough of a small frac- 
tion (less than 5%, Arav 1995, North et al. 2006) of BALQ- 
SOs located 5900 km s _1 blue-ward of the centre of the 
line to which the ab sorption is attr i buted . The feature was 
first investigated by iKorista et. al.l (1 19931 ) after being seen 
in a difference spectrum of BALQS O and non-BALQSO 
spectral composites produced by IWevmann et. alj (|l99ll ) 
who pointed out that the difference between the two ab- 
sorption troughs was approximately equal to the velocity 
separatio n of N v and Ly-q. In a series of papers i n the 
mid 90's jArav and "LI 1 19941; [Arav. Li and Begelmanl Il994l ; 



lArav and Begelmanl Il994l ; lArav et. all Il995l ; lAravl fl99rl ) 
Arav suggested that the feature seen at this velocity was a 
result of the increased radiative pressure on N v ions at this 
velocity within the outflow. In this picture N v ions mov- 
ing at 5900 km s _1 'see' the Ly-a emission from the AGN at 
the energy of their own emission resulting in a large increase 
in the scattering cross section of the Ly-a photons. Conse- 
quently, out-flowing N v ions receive an increase in radiation 
pressure leading to a large injection of momentum into the 
outflow. The out-flowing ions of other species are dragged 
to higher velocities by electromagnetic interactions with the 
N v ions, leading to a deficit of material at this velocity 
(because each out-flowing ion spends very little time at this 
velocity before being accelerated to greater velocities). In 
turn this causes a decrease in the opacity at this velocity 
which manifests as a peak within the absorption troughs of 
the out-flowing species. 

This simple mechanism explains both the presence of 
the feature and why it 's pos ition is linked to the rest frame 
of the source FI lAravl (l996) suggested that the reason for 
the feature being most prominent in C IV absorption rather 
than in other species, is that the doublet separation of C IV is 
far smaller (498 km s _1 ). By comparison the larger doublet 
separations of Si IV (1933 km s _1 ) and O VI (1647 km s _1 ) 



1 This is not the case for shadowing of N V by Ly-a 
llKorista et. al.lll993fi where the separation of the troughs is con- 
stant but their velocities are determined by the velocity of the 
first trough. 
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result in the feature being blurred over several thousand 
km s _1 , producing a low contrast feature which is difficult 
to detect in low S/N data. For this reason, in this study 
we confine our search for ghost features to the C IV broad 
absorption line only. 

Since Ly-cv and N v are amongst the strongest UV lines 
in AGN spectra, an obvious question to ask is why aren't 
ghost-features ubiquitous amongst the general BALQSO 
population? To answer this, Arav proposed a set of phys- 
ically motivated criteria which had to be met before ghost 
features would be observable. These are: 

(i) Significant broad absorption in the region between 
3000 and 9000 km s _1 blue- ward of line centre, 

(ii) Strong intrinsic Ly-a emission, 

(iii) Narrow Ly-a emission, 

(iv) Strong broad N v absorption, 

(v) Little far-UV flux between between A200A and 
AlOOOA. 

The first criterion is simply a statement that ghost-features 
can only be observed if significant photon scattering is oc- 
curring. The second criterion implies that there must be a 
significant flux of Ly-a photons, while the third requires that 
the line-emitting gas has a relatively narrow spread in veloc- 
ities so that the resulting feature is not spread over a large 
range of velocity reducing the contrast. Since the dominant 
scattering ion is N v, strong N v absorption is also needed. 
The final requirement, and the most difficult to measure, 
recognises that in order for Ly-a-N v line-locking to dom- 
inate the dynamics of the flow, there cannot be significant 
far-UV line driving (Korista et al. 1993). 



3 SELECTION OF GHOST-CANDIDATE 
SPECTRA 

BALQSOs represent ~15% of the quasar population (see e.g. 
Knigge et al. 2008 and references therein), of these, between 
~20-25% show evidence for multiple troughs (|Korista et. al.l 
ll993l ; lNorth et. al.l2006T ). Amongst the multi-trough objects, 
ghost candidates are likely rare representing less than a few 
per cent of BALQSOs (Arav 1995) because of the stringent 
requirements for ghost observability. Thus in order to find 
significant numbers of ghost-candidates, large quasar sam- 
ples are required. 

All previous ghost candi date samples have been limite d 
to just a handful of objects l|North et. al.ll2006l ; lAravlll996tl . 
By comparison, Data Release 5 (DR5) of the Sloan Digital 
Sky Survey (SDSS) contains 77,429 quasars of which 28,421 
have redshifts between 1.7 and 4.2 allowing C IV and any 
associated absorption to be seen in their optical spectra. 
We require a visible C IV BAL because although radiative 
line driving will produce a feature in all absorption troughs, 
the resultant hump will be most easily observed in the C IV 
absorption trough because of the small doublet separation 
of this line. Several BALQSO samples hav e been compiled 
from the various data relea s es of the SPSS dReichard et. al.l 
l2003al ; pTrump et. all 120061 ; iKnigge et all I2008T) and from 
DR5 (|Gibson et. all 120081 ; IScaringi et. all 120091). He re we 
adopt the BALQSO catalogue of Scaringi et. al.l ((2009), con- 
taining 3,552 BALQSOs. The Scaringi et al. sample, differs 




Figure 1. Top-Panel: Geometric mean composite spectrum of 
objects (1019) rejected due to having multiple trough features 
(solid line) fit with a reddened DR5 QSO composite (dashed line). 
Middle and lower panels: example spectra of individual objects 
rejected at this stage. The dashed vertical lines indicates the lo- 
cation of the ghost-zone. 



from other samples in that is does not rely on simple met- 
rics such as the Balnicity index (BI, Weymann et al. 1991) 
or Absorption index (Al, Hall et al. 2002, Trump et al. 2006) 
to identify and classify BALQSOs. Rather it uses a robust 
hybrid method, involving gross classification with a super- 
vised neural network (learning vector quantisation) and the 
visual inspection of outliers. 

To identify strong ghost candidates we have performed 
five cuts on this BALQSO sample. The first cut eliminates 
noisy spectra (S/N<4 per pixel), estimated in two line- free 
continuum bands from AA1650-1700A and AA1700-1750A. 
To reduce contamination from emission, absorption and cos- 
mic rays that may be present in either bin we take the 
greater of the two values as our S/N estimate. The sec- 
ond cut removes all objects which show an apparently sin- 
gle smooth absorption trough. Following this cut we are left 
with only those objects which display multiple absorption 
trough features (1019 in total)Q 

A third rejection cut removes those objects that con- 
tain 2 or more peaks within the absorption trough (num- 
bering 761). These objects are eliminated to reduce contam- 
ination by objects which display absorption from multiple 
out-flowing regions which by chance happen to lie at veloc- 
ities corresponding to the ghost-zone. This leaves just 258 



2 Approximately 28% (1019 out of 3552) of our BALQSOs show 
evidence for multiple troughs. This compares favourably with pre- 
vious studies by North et al. (2006) who found that 58 out of 224 
(26%) of BALQSOs from SDSS EDR showed evidence of multiple- 
troughs, and Korista et al. (1993) who found 16 out of 72 objects 
(22%) showed multiple-troughs. 
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Figure 2. Top-Panel: Geometric mean composite spectrum of 
objects (189) rejected due to having double-trough, ghost-like fea- 
tures, that appear outside of the ghost zone (solid line) fit with a 
reddened DR5 composite (dashed line). Middle and lower panels: 
example spectra of individual objects rejected at this stage. The 
dashed vertical lines indicate the location of the ghost-zone. 



Figure 3. Top-Panel: Geometric mean composite spectrum of 
objects (26) rejected due to having insufficient absorption either 
side of the ghost feature. Middle and lower panels: example spec- 
tra of individual objects rejected at this stage. The dashed vertical 
lines indicate the location of the ghost-zone. 



objects which contain a single-peak (or alternatively double- 
trough) within the C IV absorption region. 

Figure [T] (upper panel) shows our multi-trough compos- 
ite spectrum (solid line) created from taking the geometric 
mean of all objects rejected at this stage after normalising 
the flux between AA1700-1750A to unity. In order to high- 
light the absorption regions, we also plot a non-BALQSO 
composite spectrum (dashed line). The non-BALQSO com- 
posite is the geometric mean of all of the quasar spectra 
in the SDSS DR5 quasar catalogue in the redshift range 
1.7<z<4.2 excluding those objects which are also in the 
BALQSO catalogue of Scaringi et al., and reddened to fit 
to the continuum windows in the multi-tr ough comp osite 
spectrum using the SMC extinction law of iPeil (1992). For 
completeness, the middle and lower panels show two exam- 
ple spectra rejected at his stage. We note that our multi- 
trough composite spectrum [T] shows no evidence for coher- 
ent structure within the C IV trough. This suggests that the 
positions of the various absorption systems in the individ- 
ual spectra are in general uncorrelated, that is they do not 
contain a large number of ghost candidates that happen to 
show additional peaks within the absorption. This is consis- 
tent with the hypothesis that these multiple-trough spectra 
are the result of multiple unconnected absorbing systems. 

For the 258 objects which show a single peak in their 
C IV absorption we make a further cut to remove those ob- 
jects (189) which show p eaks outside of the ghost-zone as 
defined by iNorth et. al] (2006). The ghost zone edges are 
5900 km s _1 blue- ward of the peaks of the doublet of the 
emission line in question (in this work C iv) this zone is then 
expanded to take into account redshift errors by multiplying 



these boundaries by l±Az/(l+2 TOe d), where z me d is the me- 
dian redshift of our sample (z me d =2.14), and Az is the av- 
erage redshift error (Az =0.01). This results in a ghost zone 
extending between A1513.2 A and A1525.4 A for C iv. Our 
single-peak composite comprising 189 objects whose peak 
lies outside of the ghost-zone is shown in Figure [5] (upper 
panel, solid line). The middle and lower panels show example 
spectra of objects rejected at this stage. Each object shows 
a clear peak located just outside of the limits of the ghost 
zone. Interestingly, this composite spectrum shows some ev- 
idence of a feature at the low velocity edge of the ghost 
zone. One possible explanation for this feature is that these 
objects are genuine ghost-candidates with poorly assigned 
redshifts. However, on closer inspection of the objects re- 
jected at this stage we find no evidence for features outside 
of the ghost zone due to random redshift errors. Alterna- 
tive explanations for the origin of this feature include, (i) 
intrinsically weaker absorption at lower velocities, or (ii) an 
abrupt change in the intensity of the (overlying) emission at 
these velocities. 

We make one further cut to remove those objects which 
do not show significant broad absorption either side of the 
ghost feature extending from 3000 to 9000 km s _1 blue- ward 
of the emission-line. This cut is chosen to eliminate objects 
with deep narrow absorption features whose alignment mim- 
ics the presence of a ghost feature. Figure [3] (upper panel, 
solid line) displays the composite spectrum of all objects 
rejected at this stage, while in the middle and lower pan- 
els, we give examples of individual objects rejected by this 
cut. Though the composite spectrum shows a clear feature 
within the ghost zone, it is not smooth, instead displaying 
sharp narrow features, whose separation is larger than the 
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Figure 4. Top-Panel: Geometric mean composite spectrum of our 
ghost-candidate sample of 43 objects. Middle and lower panels: 
example spectra of individual objects making up this sample. The 
dashed vertical lines indicate the location of the ghost-zone. 



doublet separation of C IV. This reflects both the narrowness 
of the absorption features (since intrinsically broad humps 
would be smoothed out in the averaging process yielding 
low contrast features) which make up this composite, and in 
addition small differences in their velocities relative to the 
rest-frame of the source. These differences may result from 
these features being unrelated to the ghost mechanism and 
appearing in the ghost zone by chance though we cannot 
preclude the fact that some of the objects rejected at this 
stage are indeed true ghost-candidates. 

Our final sample of 43 ghost-candidate objects com- 
prises the largest sample of objects exhibiting the ghost of 
Ly-a and the first to allow a statistical analysis of the prop- 
erties of these objects. Figure U shows the geometric mean 
composite ghost-candidate spectrum (solid line), together 
with a non-BALQSO composite spectrum (dashed line), and 
two example ghost-candidate spectra (middle and lower pan- 
els). Our final ghost-candidate composite spectrum displays 
a clear smooth double-trough structure located firmly within 
the ghost zone of a relatively broad absorption trough. Vi- 
sual comparison of this composite with that formed from 
objects rejected at the previous stage (i.e. those with insuffi- 
cient absorption either side of the ghost-zone), indicates that 
our final sample displays on average narrower Ly-a and C IV 
emission-lines, stronger Ly-a emission and deeper N v ab- 
sorption (fulfilling 3 of Arav's criteria for forming observable 
ghost features), as well as broader Si IV absorption, and sug- 
gests that we are indeed isolating those objects most likely 
to form observable ghost features. Figure [5] summarises the 
various cuts made in the creation of our ghost-candidate 
sample. 
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Figure 5. Flow diagram showing the number of objects rejected 
at each step in our ghost selection algorithm. 

3.1 Selecting a Comparison Sample 

Of the 43 strong ghost-candidates, 21 are at large enough 
redshift z > 2.15 to place Ly-a above the atmospheric cut-off 
and thus allow us to test for each object all of the criteria 
deemed necessary for ghost-formation. These are listed in 
the upper half of Table 1. For the remainder, their redshifts 
are too low to allow a direct measurement of the strength 
and width of the Ly-a emission-line. For these objects we 
estimate the strength and width of the Ly-a emission-line 
using the C IV emission-line as a surrogate, and substanti- 
ated by known correlations between the two lines (see e.g. 
Wilkes 1984, Ulrich 1989). These objects are listed in the 
lower half of Table 1. For completeness, we have also com- 
piled a sample of BALQSOs showing no evidence for a ghost 
feature or multiple trough. This allows us to compare the rel- 
ative frequency with which the ghost-formation criteria are 
met amongst BALQSOs both with and without ghost fea- 
tures. In order to create s uch a sample w e have performed 
a number of cuts on the IScaringi et. ail i|2009l ) BALQSO 
sample. As for our ghost-candidate sample we first select 
for S/N>4, and then select those objects with z > 2.15 to 
allow inspection of the N v and Ly-a emission-lines neces- 
sary to test for the presence of strong narrow Ly-a emission 
and strong N v absorption. The remaining spectra are then 
visually inspected and any object showing multiple absorp- 
tion troughs or any other hint of a ghost feature are removed 
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from the sample. We also remove those objects which do not 
show significant absorption between 3000 and 9000 km s _1 . 
This leaves just 26 objects showing significant C IV absorp- 
tion between 3000 and 9000 km s" 1 with no evidence for a 
ghost feature or multiple trough. One of these objects SDSS 
J101056. 68+355833. 3 shows evidence for a ghost feature in 
the Si IV absorption line (see A5) and is therefore also re- 
moved from our comparison sample leaving just 25 objects. 
A summary of the rejection cuts made in the creation of 
our comparison sample is given in Figure [6] The geomet- 
ric mean composite spectrum of our comparison sample is 
indicated by the dotted line in Figure 7. Aside from substan- 
tially weaker N V absorption, the strengths and widths of 
the emission-lines in the comparison sample composite spec- 
trum match those of the ghost candidate composite spec- 
trum remarkably well. The main difference between these 
two composite spectra appears to be the absence of the low- 
velocity absorption trough in the comparison sample spec- 
trum. We do not believe this difference arises from the way 
in which the two samples were selected. 

In §4 we use our ghost-candi date sample to test, where 
possible, the criteria set out by lAravl (|l996l ) required by 
radiative acceleration models to produce an observable ghost 
feature. We compare the relative frequency by which our 
ghost candidate spectra meet the criteria necessary for the 
formation of observable ghost features with our non-ghost 
comparison sample. 



4 TESTING THE CRITERIA FOR THE 
FORMATION OF OBSERVABLE GHOST 
FEATURES 

In this section we present the results of testing the crite- 
ria for the formation of observable ghost features. We first 
concentrate on the sample of 21 ghost-candidate objects for 
which all of the criteria may be tested and compare these 
results with those obtained using our non-ghost comparison 
sample. We then move on to our remaining objects and test 
where possible whether these objects satisfy the necessary 
criteria. 

4.1 Significant C IV broad absorption line 

All of the objects in our ghost candidate sample will clearly 
show significant absorption in the region between 3000 and 
9000 km s _1 blue- ward of the C IV emission line as this is 
required in order to see any potential ghost feature. This 
criterion is required purely because in order to effect the 
out-flowing C IV ions in an observable way this outflow must 
reach significant optical depth and appear as a BAL trough 
within the spectrum. Our comparison sample has also been 
chosen to exhibit strong broad absorption, so naturally also 
fulfils this criterion. 

4.2 Strong Intrinsic Ly-a emission 

In order for the effects of Ly-a radiation on N v to cause 
a significant increase in the radiation pressure on the out- 
flow as a whole the Ly-a emission line must be intrinsically 
strong. However, since the ghost of Ly-a is produced by 
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Figure 6. Flow diagram showing the number of objects rejected 
at each step in our comparison sample selection algorithm. 

scattering Ly-a photons it can be extremely difficult to mea- 
sure the str e ngth of the intrinsic Ly-a emission- line. Hence 
lArav et. all (| 19951 ) modified this criterion to allow the C IV 
and N v emission-lines to be used in situations where the Ly- 
a emission-line was either significantly absorbed, or had in- 
sufficient spectral coverage. This modified criterion requires 
Ly-a emission to be at least 300% above the continuum 
and/or both C IV and N v to be at least 100% above the 
continuurrjf] . They note that 'the latter criterion is hardly 
ideal but observations of non-BAL quasars show that the 
line strengths roughly scale together' (Arav et al. 1995). 
Here we isolate the continuum emission using the SPECFIT 
package in iraf. We model the continuum as a reddened 
power law and fit to the continuum in regions free of con- 
taminating emission- and absorption- lines, where possible. 
This can be particularly difficult for objects with the highest 

3 We note that measuring the strength of a line according to 
its peak flux can be misleading, since it implies that the under- 
lying line profiles are similar for all objects (and thus the total 
flux in the line simply scales with the peak flux). For example, 
a comparison between the C IV emission-line of the composite 
spectra in Figure 7 suggests that both our ghost-candidate and 
comparison samples display a narrower underlying C IV emission- 
line. If the widths of the other emission-lines scale with the C IV 
emission-line width, then we would expect both the Ly-o and N v 
emission-lines to be similarly narrow. 
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Figure 7. Comparison between the geometric mean composites 
formed from (i) 21 ghost-candidates with z > 2.15 (solid line), (ii) 
25 BALs with featureless broad absorption troughs (dotted line), 
and (iii) non-BALQSOs from the DR5 QSO catalogue (dashed 
line). 



velocity out-flows as there are few emission/absorption line- 
free regions. Furthermore, we have not attempted to fit for 
the continuum short-ward of A1280A as this region is typ- 
ically so absorbed and the line-emission so highly blended 
that no clean continuum regions can be identified. Instead, 
we fix the continuum strength at short wavelengths to that 
measured between AA1315-1350A. This approach is chosen 
to avoid overestimating the strength of the broad absorp- 
tion, at the expense of overestimating the strength of the 
line-emission in a few cases. 

In objects whose spectra cover the Ly-a region, 86% 
(18/21) of the ghost candidate sample meet the criterion for 
strong emission-lines compared to 88% (22/25) of the com- 
parison sample. Inspection of the composite spectra for the 
two samples (Figure[7|), indicates that the strength of the Ly- 
a emission-line (relative to the continuum) is broadly simi- 
lar in the two samples, and both are weaker relative to the 
non-BALQSO composite, due to the presence of strong N v 
and Ly-a absorption. For the other lines, the N v emission- 
line appears stronger on average in our ghost-candidate and 
comparison samples relative to non-BALQSOs, while there 
appears little difference in the strength of the C IV emission- 
line, though we note that in so far as we can measure them, 
their emission-line widths appear to be narrower on average. 

4.3 Narrow emission-lines 

Models of outflows accelerated b y radi at ive line drivi n g pro - 
duced by lArav and Begelmanl j 19941 ); lArav et. all <|1995t l 
suggest that once the line widths exceed 3500 km s _1 
(FWHM) potential ghost features become undetectable. 
This is because broader emission-lines will result in a 



broader, lower contrast feature (since the core of the line, 
where most of the scattering occurs, is weaker relative to 
the wings in a broad emission-line c.f a narrow emission- 
line) which is difficult to detect particularly in low S/N 
data. Unfortunately, determining the width of the under- 
lying emission-lines in BALQSOs is extremely difficult as 
the broad absorption in these objects prevents accurate 
measurements of the shape and strength of the underly- 
ing emission-lines and continuum. Given how our ghost- 
candidate sample was selected we expect all of the ghost- 
candidates to show significant absorption in the blue wing 
of Ly-a, N v and C IV, while our comparison sample was 
chosen to exhibit strong relatively featureless absorption in 
the blue wing of C IV. 

Since we cannot make an accurate measurement of the 
width of the Ly-a emission-line we instead attempt to es- 
timate its likely width using the C IV emission-line width 
as a surrogate. For the majority of sources, C IV is severely 
absorbed, so we estimate its' underlying un-absorbed width 
using model fits to the red wing of the emission-line. We use 
the same underlying fit for the continuum as was used to 
measure the emission- line strength (§4.2). We fit the red- 
wing with a two-component model, a Gaussian core and 
Lorentzian wings, with the central wavelengths fixed to- 
gether but allowed to vary slightly from the rest wavelength 
of C IV. We allow the strength and width of the two compo- 
nents to vary independently and fit to the red-wing taking 
care to avoid those regions of the spectrum contaminated by 
broad He n A1640 and [O in] A1663 emission. The measured 
width is taken to be the FWHM of the composite fit. If the 
measured FWHM<3500 km s _1 , then the object meets the 
criterion for the formation of an observable ghost feature. 

For our 21 ghost-candidates with z>2.15, only 12 (57%) 
have FWHM (C iv) < 3500 km s" 1 . In the remainder, those 
for which the C IV width can actually be measured (17 ob- 
jects), 8/17 (47%) also meet this criterion. This fraction is 
similar to that found for our comparison sample where 12 
out of the 20 objects (60%) for which C IV width measure- 
ments can be made, also meet this criterion (see Table 2 
for details). Taking the average of all of the FWHM mea- 
sured for the ghost candidates gives an average FWHM of 
3345±842 km s -1 wh ich is similar to the upper limit pro- 
posed by lAravl (|l996t ) and to the average FWHM of C IV 
found for the comparison sample (3534±1160 km s^ 1 ). A K- 
S test on these populations shows no significant difference 
between the ghost and comparison samples. The difficulty in 
fitting the C IV line in many of these objects results in large 
uncertainties in the estimated widths, however it is clear 
that several of our ghost candidates have emission t hat is 
wider than the 3500 km s _1 upper limit proposed by I Aravl 
( 1996). There are a number of possible explanations for this. 
Firstly, C IV may be a relatively poor surrogate for the width 
of Ly-a. Alternatively, our sample may suffer from contam- 
ination by objects in which the potential ghost feature is 
produced by a chance alignment of multiple absorption sys- 
tems or through other mechanisms unrelated to radiative 
acceleration. If the potential ghost-features are indeed due 
to line-locking, then it may in-fact be possible to produce 
an observable ghost feature from emission-lines with widths 
in excess of 3500 km s _1 . 
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4.4 Strong N v broad absorption line 

As described in §2, the ghost feature is produced by the 
increased radiation pressure on out-flowing N v ions by Ly-a 
photons. Since in this model the N v ions scatter a significant 
fraction of the incident flux, strong N v absorption is a pre- 
requisite. 

In order to measure the strength of absorption due to 
N v we use a modified Balnicity index (Weymann 1991). We 
take the continuum to be equal to the mean flux between 
AA1315-1330A, and require that the flux drops to below 90% 
of this value continuously for > 1000 km s _1 between and 
7000 km s _1 blue-ward of the N v emission line. We use the 
7000 km s _1 limit to avoid contamination by absorption due 
to Ly-a. Any object with a non-zero value for this measure 
is considered to show strong N v broad absorption. 

Significant N v absorption is measured in all 21 objects 
from our ghost candidate sample whose redshift allows de- 
tection of N v absorption. Only 44% (11/25) of our compar- 
ison sample satisfy this criterion. The geometric composites 
of Figure 7, confirm that our ghost-candidate sample dis- 
plays significantly stronger N v absorption on average than 
our comparison sample. Moreover, the broadly similar N v 
emission-line strengths of the ghost-candidate and compar- 
ison sample, emphasises the increased importance of N v 
absorption in the ghost-candidate spectra. 

4.5 Little far-UV flux 

Photo-ionisation models suggest that the far UV spectra of 
quasars contain a significant number of emission-lines be- 
tween A200 and A1000A that will contribute to the radia- 
tion pressure on any BAL outflow (Korista et al. 1993). In 
order for a ghost feature to be observed the radiation pres- 
sure of Ly-a on the N v ions in the outflow must be at 
least comparable in strength to the radiation pressure on 
all other ions in the outflow. This led Arav to suggest that 
the far-UV continuum must therefore be necessarily weak in 
objects showing ghost features. Absorption of far-UV pho- 
tons both within the host galaxy as well as within our own 
galaxy preclud e direct measurem ent of the far-UV contin- 
uum. However lArav et. al.l (119951 1 note that the strength of 
the He n A1640 A emission-line can be used as a surrogate 
for the far-UV ionising continuum. A strong He n line sug- 
gests a similarly strong far-UV flux and vice-versa as this 
line is produced by photons with energies corresponding to 
A228A(Ri54 eV). 

However, the He n emission-line strength is notoriously 
difficult to measure even in high quality spectra. This is 
because of its close proximity to the [O ill] A1663 emission- 
line, as well as its location near the red-wing of C IV, which 
may extend to relatively high velocities. Thus isolating the 
broad He n component generally requires multi-component 
fitting (see e.g. Goad and Koratkar 1998) which without 
prior information (for example, variability data), is rather 
subjective. 

Since the quality (in terms of S/N) of the individual 
spectra in both our ghost-candidate and comparison sam- 
ples is relatively low, we do not attempt to fit to the indi- 
vidual line components. Instead, we use our non-BALQSO 
composite spectrum as a template for the region of interest, 
and scale each spectrum to this template, by minimising 



the residual flux in the region of interest. Our optimisation 
routine fits for the underlying continuum in both spectra, 
and adjusts for both redshift errors and the scale factor over 
the region of interest (nominally AA1600-1680A). Since both 
spectra are first normalised to the flux in the continuum 
band between AA1700-1750A, if we assume that the He n 
and [O in] emission line strengths scale together, then the 
derived scaling factor between the ghost-candidate and non- 
BAL QSO spectra is a measure of the ratio of their He 11 
EWs. Figure [8] shows an example of a fit. In the top panel we 
show the non-BALQSO composite (solid line) and a ghost- 
candidate spectrum (dotted line). In the middle panel we 
show the spectra together with their continuum fits span- 
ning the C iv-He 11 region, while in the lower panel we show 
a close-up of the non-BALQSO composite and the scaled 
ghost-candidate spectrum after removal of the underlying 
continuum. The residual flux under the He ll-[0 ill] lines is 
indicated by the dot-dashed line. We caution that this fit- 
ting process is sensitive to (i) small redshift errors between 
the spectra, (ii) an appropriate choice of continuum bands 
(in general the choice of the continuum bands depend on 
the width of the C IV absorption trough, and most impor- 
tantly (iii) the S/N of the spectra. Consequently the error 
on the scale factor is generally quite large. We have visu- 
ally inspected the results of the fitting process for all of our 
ghost-candidate and comparison spectra to verify that the 
fitting procedure has converged correctly, adjusting the con- 
tinuum bands, and region over which the spectra are scaled 
when necessary. In Table 1 we list the He 11 scale factors, 
that is, the multiplication factor necessary to produce the 
same equivalent width in the ghost-candidate spectrum. A 
number less than 1 indicates that the ghost-candidate spec- 
trum has larger equivalent width in the line in the region 
of He 11 than the non-BALQSO spectrum. Conversely, num- 
bers greater than 1 indicate that the ghost feature has a 
proportionately weaker He 11 line. In order to determine er- 
ror estimates on the derived scale factors, after determining 
a best-fit solution, we fix all parameters apart from the scale 
factor, and then minimise on this parameter only. We then 
calculate the 90% confidence interval on this one interest- 
ing parameter from the best-fit model by varying the scale 
factor until the \ 2 value has increased by 2.71. Both the 
best-fit scale factor and its range (based on the 90% confi- 
dence interval), are given in Table 1 and 2. 

For our sample of 21 ghost-candidates for which all cri- 
teria can be tested, 5 indicate a significantly weaker He 11 
EW, suggesting a weaker than average UV continuum. Only 

5 objects show evidence for a stronger than average He 11 
EW. For the rest, there is no significant difference (within 
the errors) in their He 11 EWs when compared to the non- 
BALQSO composite. For the 22 ghost-candidates for which 
only some of the criteria can be tested, 5 show evidence for 
a weaker than average He 11 EWs, 6 indicate stronger than 
average He 11 EWs, and the rest indicate no significant differ- 
ence in the He 11 EW when compared to the non-BALQSO 
composite. Thus only 11/43 (25%) of our ghost-candidate 
sample appear to satisfy the criteria for a weak UV contin- 
uum. Repeating this test on our non-ghost BALQSO com- 
parison sample 8 objects indicate weaker than average He 11 
EW, 9 objects have stronger than average He 11 EW, and 

6 objects show no discernible difference in their He 11 EW 
relative to the non-BALQSO composite. For the remaining 
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Figure 8. An illustration of our method for estimating the He II 
strength. Upper panel - non-BALQSO composite (solid line) and 
an example ghost-candidate spectrum (dashed line). Middle panel 
— model fits to the underlying continua. Lower panel - non- 
BALQSO composite (solid line) and scaled ghost-candidate spec- 
trum, after subtraction of the underlying continuum. The residual 
flux is indicated by the dot-dashed line. 

2 objects, the fits did not converge, though visual inspec- 
tion of one of these shows no evidence for significant He II 
emission. 



4.6 Supplementary requirements - the width of 
the ghost feature 

If we ignore the likely complex interaction between the Ly- 
a photons and Nv ions (ie. we assume the optical depth in 
the flow is effectively constant with velocity), then naively 
one might expect that the ghost-feature should be at least as 
broad as the Ly-a emission-line responsible for its formation. 
We note that in fact scattering is likely dominated by the 
line core where the optical depth is larger. By requiring that 
any observed ghost feature is at least as broad as Ly-a, we 
can further refine our ghost-candidate sample to leave only 
the best ghost-candidates. While the width of any ghost- 
feature is relatively easy to measure (unless the S/N is low), 
we are once again limited by the accuracy to which we can 
measure the width of Ly-a, as this emission is absorbed in 
the process by which the ghost is formed. Here we again 
make use of the width of the C IV emission-line (see §4.3) 
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Figure 9. FWHM (km s 1 ) of the C IV emission- 
line and the ghost feature. The solid line shows 
FWHM(C iv)=FWHM(ghost). 

as a surrogate for the width of Ly-a. We isolate the ghost 
feature by removing the underlying absorption trough by 
fitting a simple function (cubic spline). We then fit a single 
Gaussian to the ghost feature to provide an estimate of its 
width. The measured widths are listed in Table 1. We find 
no correlation between the width of the ghost feature and 
the width of the C IV emission-line (Figure [5J. Instead, we 
find that in the vast majority of cases the ghost feature 
is narrower than the C IV emission-line (the solid line in 
Figurei]is FWHM(C iv)=FWHM(ghost). 

If we require that the FWHM of the ghost feature 
matches that of the C IV emission-line to within 1500 km s _1 
then 11/20 (for one object we are unable to make emission- 
line width measurements) of our ghost-candidates for which 
all ghost-formation criteria can be tested also match this 
criterion. For our remaining ghost-candidates 8/17 (5 ob- 
jects have no emission-line width information) also match 
this criterion. Of the 11 candidates with similar ghost and 
C IV widths for which all the criteria are testable all 11 show 
N v absorption, 10 show strong emission lines, 9 have nar- 
row C IV emission but only 2 show evidence for weaker than 
average He II emission. Of these two one has C IV FWHM 
of 4040 km s" 1 while the other fails the criteria for strong 
emission lines. 



5 DISCUSSION 

For the 43 objects in our ghost candidate sample, 21 are 
at large enough redshift to allow us to test all of the crite- 
ria necessary for the formation of an observable ghost fea- 
ture. Of these 21/21 satisfy the condition for strong N V 
absorption. 18/21 also satisfy the condition for strong in- 
trinsic Ly-a emission. However, less than half of the ob- 
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jects (12/21) satisfy the condition for narrow emission-lines, 
and even fewer (5/21) show weaker than average He II EW. 
While 11/21 objects satisfy the first 3 criteria, no single ob- 
ject meets all of the requirements for the formation of an 
observable ghost feature. For the remaining 22 objects, only 
3 objects pass 2 of the first 3 criteria, and of these all show 
stronger than average He II. Of the 5 objects which indicate 
weaker than average He II strengths, only one can be tested 
for any of the other criteria, in this case, the width of the 
emission-line, which it fails. 

None of the objects in our ghost-c andida t e sam ple 
are present in the samples produced by lAravl (|l99rj ) or 
iKorista et. ail ||l993h due to differences in the redshift ranges 
over which they were constructed. North et al. (2006) iden- 
tified 7 strong ghost-candidate spectra in DR3 of the SDSS. 
6 of those objects are also in our sample of 43 ghost- 
candidates. The other, SDSSJ142050. 34-002553.1. has an as- 
si gned redshift of 2.0 85 in DR5 compared to the 2.103 used 
bv lNorth et. alj |2006l 'l and is therefore rejected by our ghost 
zone cut. Only one of North et al.'s ghost zone final cut is of 
sufficient redshift to test all of the criteria for the formation 
of an observable ghost feature SDSS J110623. 52-004326.0. 
This object fails the narrow emission-line requirement. For 
the other 5 objects, 3 fail the test for narrow emission-lines, 
and 4 fail the test for weaker than average He n EW. 

For our non-ghost comparison sample 12/25 meet the 
criterion for strong N v absorption. 20/25 meet the require- 
ment of strong Ly a, while 12/22 meet the narrow emission- 
line width constraint. 9/25 objects show evidence for weaker 
than average He n strengths. However, only 3 objects meet 
3 out of 4 criteria, and none of these meet the weaker than 
average He II strength. Thus, none of the objects in our 
comparison sample meet all of the criteria necessary for the 
formation of an observable ghost feature. 

A comparison between our ghost-candidate and non- 
ghost samples suggests that the main difference between 
them lies in the strength of the absorption, with our ghost 
candidate sample displaying more objects with strong N V 
absorption. Comparison of the geometric mean composite 
spectra of these two samples, and their ratio (Figure 13 
dashed (ghost-candidate) and dotted (non-ghost comparison 
spectra) lines, indicates that the ghost-candidate composite 
shows significantly stronger absorption at lower velocities in 
all of the strong lines Ly-a, N v, Si iv and C iv. 

In order to test whether peaks are more common within 
the ghost-zone than elsewhere, we repeat the ghost selec- 
tion method using two "fake" ghost-zones (red-ward and 
blue- ward of the original ghost-zone), in a similar fashion 
to North et al. (2006). These zones are created in precisely 
the same way as the ghost-zone except that the red zone 
is centred at 4000 km s _1 while the blue zone is centred 
at 8000 km s _1 blue-ward of the C iv emission-line. Of the 
258 single peaked objects, 82 have peaks within the "fake" 
red zone and 50 have peaks within the fake "blue-zone". 
Since the original ghost-zone contained 69 objects with sin- 
gle peaks, the evidence for an excess of objects with peaks 
at a preferred velocity is weak. That is, single peaks within 
the ghost zone are no more likely than single peaks at other 
velocities. 

We have also examined the link, if any, between the 
peaks within the C iv absorption trough and N v absorp- 
tion. In order to select N v BALs we use a modified Balnicity 
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Figure 10. Composite spectrum made up of all our ghost candi- 
dates fit with a reddened DR5 composite(dotted) and a BALQSO 
composite (dashed). 

index (Weymann 1991). We take the continuum to be equal 
to the mean flux between AA1315-1330A, and require that 
the flux drops to below 90% of this value continuously for 
> 1000 km s _1 between and 7000 km s _1 blue- ward of the 
N v emission line. In order to perform this test we require 
objects with redshifts in excess of 2.15. From a sample of 
1747 objects with z>2.15, we find 1258 N v BALs and 489 
N v non-BALs. 27% (340) of the N v BALs show multiple 
troughs in their C iv absorption compared to only 15.5% 
(76) of the N v non-BALs. Similarly, 7.6% (95) of the N v 
BALs and only 3.1% (15) of the N v non-BALs show a single 
peak in the C iv absorption trough. Among the N v non- 
BALs we find : i) two objects with single peaks within the 
ghost-zone, ii) two objects with a single peak in the fake blue 
zone, and iii) no objects with a single peak in the fake red 
zone. For the N v BALs, we find : i) 26 objects with single 
peaks within the ghost-zone, ii) 14 with single peaks within 
the fake blue zone, and iii) 33 with single peaks within the 
fake red zone. While these results indicate a strong link be- 
tween the presence of N v absorption and the mechanism 
responsible for producing features within the C iv absorp- 
tion trough, line-locking between Ly-ct and NV does not 
appear to be the dominant mechanism. 

In summary, N v BALs are more likely to have multiple 
troughs within the C iv absorption than N v non-BALs (fac- 
tor of 2). Further, N v BALs are also more likely to display 
single-peaks within their C iv absorption than N v non- 
BALs. Approximately 25% of the single peaked objects are 
located within the ghost-zone. However, similar numbers are 
found in both the blue and red fake ghost zones. Thus while 
strong N v absorption appears to be a strong requirement 
for the appearance of features within the C iv absorption 
trough of BALQSOs, there is no preferred velocity for the 
location of these features. 
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Figure 11. Spectra of ghost candidates from the best-candidate sample for which all criteria can be tested. 
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Figure 12. Spectra of ghost candidates from the best-candidate sample for which all criteria can be tested. 
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ously proposed, or a large fraction of these objects are multi- 
trough interlopers masquerading as ghost-candidates. This 
possibility is supported by our finding that the 'fake' ghost 
zones bracketing the 'real' ghost zone contain comparable 
numbers of objects as the 'real' ghost zone. This suggests 
that the peaks observed within BAL troughs at the posi- 
tion of the ghost-zone are not a physical effect of radiative 
line driving, but are instead independent of any systematic 
velocity. 

Since our analysis has shed real doubt as to the ex- 
istence of Ly-a-N v line-locking features, in the attached 
appendix we repeat for the interested reader, the sequence 
of tests by Korista et al. (1993) and re-investigate the in- 
cidence of double-trough phenomena in a large sample of 
BALQSOs. In summary, the results of these tests show that 
there is no strong evidence for an excess of double troughs 
bracketing the region of ghost-formation, and that the evi- 
dence for line-locking between Ly-a photons and N v ions 
in BALQSO spectra is substantially lacking. 



Figure 13. Upper-panel - composite ghost-candidate (solid line) 
and comparison (dashed line) spectra. Lower panel - ratio of 
ghost-candidate to comparison spectra. 



6 SUMMARY OF RESULTS 

In an analysis of the largest sample of BALQSOs derived 
from DR5 of SDSS, we have identified 43 objects which show 
evidence for radiative driving, as indicated by the appear- 
ance of feature in the C IV absorption, and thought to be 
associated with the interaction between Ly-a photons and 
N v ions. We have attempted to explore the reality of these 
ghost-features by testing the physically motivated criteria, 
as set out by Arav, for the formation of observable features 
via this mechanism. Of the 21 objects for which we can test 
all of the criteria, none satisfy all of the conditions necessary 
for the formation of an observable feature. Two of the cri- 
teria, the width of the driving line, and the strength of the 
UV continuum are particularly difficult to measure. Of the 
5 objects that clearly pass the latter criteria, having weak 
He n, 4 have widths that are marginally larger than the 
upper limit imposed by Arav necessary to produce strong 
ghost signatures, while the other appears to have only weak 
emission-lines. We recognise that because of the large uncer- 
tainty on the measurements of the He II strength, 8 further 
objects from the sample of 21 could potentially satisfy all of 
the criteria necessary for ghost-formation. Addressing this 
issue will require higher S/N data than currently available 
for these objects. 

However, taken at face value either the conditions nec- 
essary for line-locking to occur are less stringent than previ- 
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Table 1. Our Ghost sample and the results of testing Arav's criteria along with the FWHM of the ghost feature 



* Objects identified by North et al 2006 as their best ghost-candidates, their Ghost Candidate Final Cut (GCFC). 

a Poor continuum fit. 

6 He ii absorbed. 

c He II too weak to be measured. 
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Table 2. As for Table 1, indicating our comparison sample of 25 objects selected to have broad absorption and no ghost-features and 
the results of testing Arav's criteria for ghost formation. 
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Figure 14. Ghost candidates for which only some of the criteria can be tested. 
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Figure 15. Ghost candidates for which only some of the criteria can be tested. 
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APPENDIX A: ON THE REALITY OF THE 
DOUBLE-TROUGH PHENOMENA 

Since the criteria by which observable ghosts are formed 
appears less stringent than has previously been claimed, we 
have performed a series of sanity checks to test whether 
ghost-features and double-troughs in general, are not simply 
a result of the random superposition of multiple absorption 
systems. The tests performed are similar to those described 
in Korista et al. (1993), though we note that our parent 
BALQSO sample is considerably larger and thus the quality 
of the statistics is a significant improvement over previous 
work. 

Al Random sampling 

In the first test we compute the geometric mean residual in- 
tensity spectrum (the ratio of the geometric mean BALQSO 
composite divided by a geometric mean non-BALQSO com- 
posite spectrum) from two random samples (without re- 
placement) taken from the BALQSO catalogue of Scaringi 
et al. (2009) (Figure Al, sample 1 (solid line), sample 2 
(dashed line). These samples (consisting of 1776 objects 
each) were generated by sorting the BALQSO catalogue into 
RA order, and then selecting even-numbered objects for the 
first sample, and odd-numbered objects for the second sam- 
ple. The mean residual intensity spectra for both samples 
are virtually indistinguishable, aside from a small difference 
(^15%) in the residual intensity in the velocity range 3900- 
7800 km s~ x blue-ward of line centre, which intriguingly, 
is precisely where the purported ghost feature should be 
found. We have checked to see whether there exists an ex- 
cess of ghost-candidate spectra in either sample, and find 
that the ghost candidates are spread evenly amongst both 
samples, with 21 in sample A and 22 in sample B. While 
there are small apparent differences in the mean residual 
intensity spectrum in the region of the ghost-feature, nei- 
ther sample shows convincing evidence of a double-trough 
feature. 

To explore these findings further, we have attempted 
to address whether there exists any substantive evidence 
for absorption components at preferred velocities. If ghost 
features are relatively common, we would expect an excess 
abundance of peaks within the absorption trough at veloc- 
ities 5900 km/s blue-ward of line centre. To address this 
question we have performed two additional tests. In these 
tests we work with fluxes instead of residual intensities to 
avoid complications introduced by uncertainties in fitting 
the C IV emission-line blue- ward of line centre. In the first, 
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at approximately 15,000 km/s. As with the previous test, 
varying the bin widths between 500-1500 km/s does not sig- 
nificantly alter the results. 

These additional tests suggest that for the general 
BALQSO population, there is no substantive evidence that 
there exist preferred velocities for the location of peaks 
within the broad absorption troughs of BALQSOs. Thus 
if ghosts are real they are certainly rare, as was first sug- 
gested by Arav. We note that the incidence of single peaks 
(or double troughs) in our sample 258/3552 (7%) is signifi- 
cantly smaller than that found by Korista et al. (1993, 22%), 
likely a result of the small number statistics of this earlier 
study. 



0.2 - 



o 
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Figure Al. Geometric mean composite residual intensity spec- 
trum of two randomly selected BALQSO samples (without re- 
placement) . 



we take the LVQ BALQSO sample of Scaringi et al. 2009, 
and compare the fluxes in 3 adjacent velocity bins (A, B 
and C) each with widths of 1000 km/s, where B is the ref- 
erence bin and A and C represent the adjacent bins. Sliding 
these bins from a velocity of 0-30,000 km/s blue-ward of 
Civ rest, we record the number of times the mean flux in 
the central bin (bin B) exceeds the mean flux in adjacent 
bins A and C by more than 3 a. To show the effects of 
selecting different bin- widths over which the fluxes are mea- 
sured, we repeat this exercise for bin widths of 500 and 1500 
km/s. The results of this procedure are shown in Fig A2 (up- 
per left panel). The distribution of peaks shows a prominent 
peak at zero velocity, which we associate with the location of 
the C IV emission-line peak. Ignoring this feature, the peak 
distribution is a generally smooth function of velocity with 
no indication of an excess number of peaks at 5900 km/s 
blue-ward of line centre. The same analysis performed on 
our ghost candidate sample, displays a similar peak at zero 
velocity, and in addition further peaks at around 6000 km/s 
and 11,000 km/s. While the numbers of peaks at higher ve- 
locities is only marginally less than that at 5900 km/s, visual 
inspection of our ghost candidate sample suggest that the 
significance of these secondary peaks is rather low. 

We have also performed a variation on this method in 
which instead of measuring the intensities within each bin, 
we measure the mean gradient of the intensities within each 
of the bins. A peak is recorded if and only if the gradient 
in bin A is positive, the gradient in bin A is greater than 
the gradient in bin B, the gradient in bin B is greater than 
gradient of C, and the gradient in bin C is negative. Fig- 
ure A2 lower panels shows the result of this test. Again, the 
peak at zero velocity we associate with the peak of the C IV 
emission-line. Thereafter the distribution of the locations of 
the peaks follows a smooth broad function with a maximum 
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Figure A2. Distribution functions showing the incidence of peaks as a function of velocity blue-ward of Civ rest wavelength (see text 
for details). Upper left panel - LVQ BALQSOs, peak distribution functions (solid line 500 km/s bins, dotted line 1000 km/s bins, dashed 
lines 1500 km/s bins. Upper right panel - as for upper left for the ghost-candidate sample. Lower panels , as above except now using 
slope comparisons. 
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Figure A3. Residual intensity spectra after aligning all spectra 
to the position of the first (lowest velocity) absorption trough. 
We see no indication of a second trough at the known separation 
of Ly-a-N v. 



Figure A4. The frequency distribution of velocities of the first 
absorption trough. 



A2 Appearance of double-troughs at other 
velocities 

In the second test we verify whether there exist associated 
troughs (via line-locking) at velocities other than systemic. 
We do this by aligning each spectrum according to the lo- 
cation of the red-edge of the first BAL trough and thereby 
search for evidence of a bi-modal distribution of sub-troughs. 
Here, the red-edge is defined as in Korista et al. (1993) to 
be where the residual intensity falls below 0.75 and remains 
below this level for more than 1000 km s _1 . The resultant 
geometric mean residual intensity spectrum is shown in Fig- 
ure A3. As one might expect, the residual intensity is some- 
what steeper at lower velocities, a result of aligning the first 
troughs in velocity space. However, the remainder of the 
spectrum is a smooth function of velocity, with no indica- 
tion of additional sub-troughs. Thus, as with Korista et al. 
(1993) we find no evidence for line-locking features at veloc- 
ities other than systemic. 

Unlike Korista et al. (1993) the distribution of veloc- 
ities of the first trough peaks at velocities between 1500- 
2000 km s _1 blue- ward of line-centre, thereafter the number 
of objects contributing to each velocity bin shows a smooth 
linear decline. Thus while there appears a preference for the 
onset of BALs toward lower velocities, there is no indication 
of other preferred blue- ward velocities. This again suggests 
that the incidence of the double-trough signature among 
the BALQSO population as a whole is generally low. We 
note that the distribution of redshifted troughs are similarly 
peaked toward lower velocities and may indicate a preferred 
launch radius for the out-flowing gas. 




-1x10* -5000 
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Figure A5. Residual intensity spectra of (i) the multi-trough 
sample, (ii) the single peak sample, and (iii) ghost-candidate sam- 
ple. See text for details 



A3 K-S test 

In our third test, we compute the mean residual inten- 
sities in 3 separate velocity bins, one centred on the 
expected location of any ghost-feature, the other taken 
from bins either side of this feature, ie. velocity bins at 
4000-5000 km s"\ 5500-6500 km s _1 (the reference bin), 
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Single Peak Sample 
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41.4 


Ghost Zone Sample 
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0.1 



Table Al. K-S tests on the multi-trough phenomenon 



and 10000-11000 km s _1 respectively. We then use the 
Kolmogorov-Smirnov (KS) test to check whether the dis- 
tribution of residual intensities within each of these velocity 
bins belongs to the same population as the reference bin. 
The residual intensity spectra we use for this test are (i) 
the total sample of BALQSOs from Scaringi et al. (2009), 
the randomly selected samples 1 and 2 described above, our 
Multi-trough sample from rejection cut 2, our single peak 
sample from rejection cut 3 and our candidate ghost sam- 
ple. The results of this test are presented in Table Al. 

Table Al gives the probability that the K-S statistic 
of the sub-trough with the reference trough (i.e. the ghost 
zone) would be exceeded by chance if they were drawn from 
the same population. That is, lower K-S probabilities indi- 
cate more significant differences between the chosen veloc- 
ity bin and the reference bin. Our analysis shows that only 
for our ghost sample do both sub-troughs 1 and 2 indicate 
significant differences from the reference bin (K-S probabil- 
ities less than 0.1%). This is as expected, since members of 
this sample were selected precisely because they showed sig- 
nificant ghost-like features. The single peak sample shows 
no evidence that the residual intensity of the ghost zone is 
drawn from a different population to that of sub-trough 2. 
This suggests that the single peak sample does not have a 
systematic peak and that the position of peaks within the 
absorption is random and unrelated to Ly-a-N v line lock- 
ing. 



A4 A Monte Carlo test for ghosts 

Korista et al. (1993) described one further test for an ex- 
cess of double-trough features bracketing the Ly-a-N v line- 
locking region, which we repeat here for completeness. In 
this Monte Carlo simulation, we test how frequently the 
depth of the double-trough structure is exceeded by a ran- 
dom arrangement of residual intensity differences among 
each of the 3 velocity bins (see Korista et al. 1993 for de- 
tails). For each spectrum, we can form 6 residual intensity 
differences from 3 velocity bins. For each spectrum in our 
BALQSO catalogue we randomly pick 2 from the 6 possible 
residual intensity differences and calculate their mean value, 
i.e. an average residual intensity difference. We then define 
< D > r andom to represent the mean of these average inten- 
sity differences across our BALQSO sample. < D > r andom 
is then compared to < D >, formed by averaging the mean 
residual intensity differences between sub-troughs 1 and 2 
with the reference bin, again averaged across our whole sam- 



BALQSO Sample 


< D > 


Monte Carlo 






Probability x 100% 


LVQ BALs 


-0.01197 


98.88 


MTS 


-0.03918 


99.8 


Single Peak Sample 


0.007355 


35.8 


Ghost Zone Sample 


0.16932 


0.0 



Table A2. Monte Carlo simulations of the double trough phe- 
nomenon 

pie. < D > can take on any value between ±1. Values of 

< D > near +1 indicate a strong double-trough structure. 
Conversely, a value of < D > near —1 indicates a strong 
trough at the position of the reference bin. Values near zero 
are indicative of a smooth trough passing through the ve- 
locity bins. 

To test how frequently a double-trough structure having 
a contrast at least as large as that observed in the sample 
mean could arise by chance, we repeat this process 10,000 
times for each sample, and determine the number of times 

< D >rcmdom exceeds < D >, normalised to 10,000. The 
results of these simulations are shown in Table A2. 

Since we only find strong evidence for the double-trough 
structure within our ghost zone sample (chosen for precisely 
that reason), we conclude that there is indeed no strong 
evidence for an excess of objects with sub-troughs bracket- 
ing the ghost-zone among the BALQSO population at large. 
Indeed, the high probabilities found for both the LVQ sam- 
ple and for the multi-trough sample, strongly suggest that 
multi-trough features are randomly distributed in velocity. 
We are therefore conclude that many of the objects previ- 
ously identified as strong ghost-candidates in previous stud- 
ies may simply be MT interlopers masquerading as ghosts 
due to the chance alignment of multiple un-associated ab- 
sorption systems. 

A5 SDSS J101056. 68+355833. 3: Si IV ghost? 

This is an object selected for our comparison sample as 
it shows no hint of a ghost feature within the deep broad 
C IV absorption. The spectra is shown in Figure IA6I along 
with the flux in velocity space relative to C IV and Si IV 
showing clearly the potential ghost feature in the Si IV ab- 
sorption. SDSS J101056. 68+355833. 3 meets the criteria for 
strong emission due to C IV and N v emission exceeding 
100% of the continuum flux at their peak, shows clear N v 
absorption, has C IV FWHM measured as 3483 km s _1 and 
an EW of He n measured to be 3.58±0.82 A. These measure- 
ments place this object very close to the boundary on several 
criteria and it would not be surprising to see a ghost feature 
in this object. While no ghost feature can be seen in the 
C IV trough, there is a clear feature within the Si IV trough 
(~6200km s _1 ) within the ghost zone. Due to the greater 
separation of the Si IV doublet(1933 km s -1 ) in comparison 
to the C IV doublet (498km s _1 ) a ghost feature in the Si IV 
would be expected to be wider and weaker. This larger dou- 
blet separation is the reason for the wider ghost zone in the 
S IV BAL trough. The absence of a C IV feature could be due 
to an extremely high optical depth in the flow such that the 
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Figure A6. SDSS J101056.68+355833.3: A possible Si iv ghost 
showing no evidence of a C IV feature along with close ups in 
velocity space of the C IV (middle) and Si IV (top) ghost regions. 
Vertical lines represent the ghost zone 



flow remains optically thick even with the decrease in optical 
depth caused by the acceleration due to N v ions. However 
the absorption troughs of C IV and Si IV look very similar 
with the exception of the potential ghost feature suggesting 
the optical depths are similar. There is a slight hint of an 
additional feature within the Si IV at ~8200km s _1 which is 
in the appropriate region to be due to the lower wavelength 
doublet feature from Si IV. 



